Search CORE

Sussex Research Online

Modulating signaling networks by CRISPR/Cas9-mediated transposable element insertion

Author: A Bolotin
A Burt
A Hammond
AP Lorenzetti
B McClintock
BA Castilho
C Feschotte
C Feschotte
C Feschotte
C Kettlun
C Lu
C Ye
CN Hancock
DBT Cox
DJ Garfinkel
F Palazzoli
FA Ran
FJM Mojica
G Feng
G Mai
G Yang
H Kuang
H Mao
HT Dong
IK Jordan
J Hou
J Li
J Paszkowski
JA Doudna
JB Owens
JD Hollister
JE DiCarlo
JI Qüesta
JK Baillie
JM Casacuberta
K Kikuchi
K Naito
K Naito
K Shirasawa
KD Kim
KM Esvelt
KS Makarova
L Bortesi
L Cong
LC Li
LM Vaschetto
Luis María Vaschetto
M Cowley
M Jinek
M Momose
M Patel
M Szabo
M Turner
N Buchon
N Jiang
N Jiang
N Jiang
Q Zhang
R Barrangou
R Jansen
RM Walsh
SD Molyneux
SR Wessler
SR Yant
T Mourier
T Sakuma
T Singer
VM Gantz
W Matsunaga
WC Lai
X Li
X Li
X Lin
X Shan
Y Ishino
Y Yan
ZN Adelman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2018
Field of study

In a recent past, transposable elements (TEs) were referred to as selfish genetic components only capable of copying themselves with the aim of increasing the odds of being inherited. Nonetheless, TEs have been initially proposed as positive control elements acting in synergy with the host. Nowadays, it is well known that TE movement into host genome comprises an important evolutionary mechanism capable of increasing the adaptive fitness. As insights into TE functioning are increasing day to day, the manipulation of transposition has raised an interesting possibility of setting the host functions, although the lack of appropriate genome engineering tools has unpaved it. Fortunately, the emergence of genome editing technologies based on programmable nucleases, and especially the arrival of a multipurpose RNA-guided Cas9 endonuclease system, has made it possible to reconsider this challenge. For such purpose, a particular type of transposons referred to as miniature inverted-repeat transposable elements (MITEs) has shown a series of interesting characteristics for designing functional drivers. Here, recent insights into MITE elements and versatile RNA-guided CRISPR/Cas9 genome engineering system are given to understand how to deploy the potential of TEs for control of the host transcriptional activity.Fil: Vaschetto, Luis Maria Benjamin. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Diversidad y Ecología Animal. Universidad Nacional de Córdoba. Facultad de Ciencias Exactas Físicas y Naturales. Instituto de Diversidad y Ecología Animal; Argentina. Universidad Nacional de Córdoba. Facultad de Ciencias Exactas, Físicas y Naturales. Cátedra de Diversidad Animal I; Argentin

Repository for Publications and Research Data

CONICET Digital

A Genome-Wide Analysis of Promoter-Mediated Phenotypic Noise in Escherichia coli

Gene expression is subject to random perturbations that lead to fluctuations in the rate of protein production. As a consequence, for any given protein, genetically identical organisms living in a constant environment will contain different amounts of that particular protein, resulting in different phenotypes. This phenomenon is known as “phenotypic noise.” In bacterial systems, previous studies have shown that, for specific genes, both transcriptional and translational processes affect phenotypic noise. Here, we focus on how the promoter regions of genes affect noise and ask whether levels of promoter-mediated noise are correlated with genes' functional attributes, using data for over 60% of all promoters in Escherichia coli. We find that essential genes and genes with a high degree of evolutionary conservation have promoters that confer low levels of noise. We also find that the level of noise cannot be attributed to the evolutionary time that different genes have spent in the genome of E. coli. In contrast to previous results in eukaryotes, we find no association between promoter-mediated noise and gene expression plasticity. These results are consistent with the hypothesis that, in bacteria, natural selection can act to reduce gene expression noise and that some of this noise is controlled through the sequence of the promoter region alon

CiteSeerX

Public Library of Science (PLOS)

Caltech Authors

Enrichment analysis of Alu elements with different spatial chromatin proximity in the human genome

Author: A Antonaki
A Huda
A Nekrutenko
A Smallwood
AF Smit
AM Deaton
C Esnault
C Feschotte
CB Lowe
CT Ong
D Grover
D Grover
D Schmidt
D Xie
E Berezikov
E Lieberman-Aiden
E Wit de
E Yaffe
EP Nora
ES Lander
ES Lander
F Cui
G Bourque
G Kunarso
G Li
G Li
GA Maston
GJ Faulkner
GN Gallus
H Santos-Rosa
H Santos-Rosa
H Xie
HH Kazazian Jr
IK Jordan
J Banerji
J Dekker
J Dostie
J Jurka
J Jurka
J Ule
JA Yoder
JE Hambor
JF Brookfield
JF Brookfield
JM Chen
JR Dixon
JR Korenberg
K Ahn
K Kaer
KC Wang
L Lin
L Teng
M Hackenberg
M Simonis
M Weber
MA Batzer
MG Kidwell
MH Kagey
MJ Fullwood
MM Suzuki
ND Heintzman
NR Smalheiser
P Jin
P Medstrand
P Polak
R Cordaux
R Eskeland
R Lister
R Schneider
R Sorek
RD Hawkins
S Shen
S Winkler
SD Gillies
SL Oei
T Pastor
T Wicker
V Kapitonov
VJ Lynch
WD Gifford
Y Lu
Y Quentin
Y Quentin
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Transposable elements (TEs) have no longer been totally considered as “junk DNA” for quite a time since the continual discoveries of their multifunctional roles in eukaryote genomes. As one of the most important and abundant TEs that still active in human genome, Alu, a SINE family, has demonstrated its indispensable regulatory functions at sequence level, but its spatial roles are still unclear. Technologies based on 3C(chromosomeconformation capture) have revealed the mysterious three-dimensional structure of chromatin, and make it possible to study the distal chromatin interaction in the genome. To find the role TE playing in distal regulation in human genome, we compiled the new released Hi-C data, TE annotation, histone marker annotations, and the genome-wide methylation data to operate correlation analysis, and found that the density of Alu elements showed a strong positive correlation with the level of chromatin interactions (hESC: r=0.9, P<2.2×1016; IMR90 fibroblasts: r = 0.94, P < 2.2 × 1016) and also have a significant positive correlation withsomeremote functional DNA elements like enhancers and promoters (Enhancer: hESC: r=0.997, P=2.3×10−4; IMR90: r=0.934, P=2×10−2; Promoter: hESC: r = 0.995, P = 3.8 × 10−4; IMR90: r = 0.996, P = 3.2 × 10−4). Further investigation involving GC content and methylation status showed the GC content of Alu covered sequences shared a similar pattern with that of the overall sequence, suggesting that Alu elements also function as the GC nucleotide and CpG site provider. In all, our results suggest that the Alu elements may act as an alternative parameter to evaluate the Hi-C data, which is confirmed by the correlation analysis of Alu elements and histone markers. Moreover, the GC-rich Alu sequence can bring high GC content and methylation flexibility to the regions with more distal chromatin contact, regulating the transcription of tissue-specific genes

University of Bedfordshire Repository

Evolutionary Dynamics of the Ty3/Gypsy LTR Retrotransposons in the Genome of Anopheles gambiae

Ty3/gypsy elements represent one of the most abundant and diverse LTR-retrotransposon (LTRr) groups in the Anopheles gambiae genome, but their evolutionary dynamics have not been explored in detail. Here, we conduct an in silico analysis of the distribution and abundance of the full complement of 1045 copies in the updated AgamP3 assembly. Chromosomal distribution of Ty3/gypsy elements is inversely related to arm length, with densities being greatest on the X, and greater on the short versus long arms of both autosomes. Taking into account the different heterochromatic and euchromatic compartments of the genome, our data suggest that the relative abundance of Ty3/gypsy LTRrs along each chromosome arm is determined mainly by the different proportions of heterochromatin, particularly pericentric heterochromatin, relative to total arm length. Additionally, the breakpoint regions of chromosomal inversion 2La appears to be a haven for LTRrs. These elements are underrepresented more than 7-fold in euchromatin, where 33% of the Ty3/gypsy copies are associated with genes. The euchromatin on chromosome 3R shows a faster turnover rate of Ty3/gypsy elements, characterized by a deficit of proviral sequences and the lowest average sequence divergence of any autosomal region analyzed in this study. This probably reflects a principal role of purifying selection against insertion for the preservation of longer conserved syntenyc blocks with adaptive importance located in 3R. Although some Ty3/gypsy LTRrs show evidence of recent activity, an important fraction are inactive remnants of relatively ancient insertions apparently subject to genetic drift. Consistent with these computational predictions, an analysis of the occupancy rate of putatively older insertions in natural populations suggested that the degenerate copies have been fixed across the species range in this mosquito, and also are shared with the sibling species Anopheles arabiensis

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositori Institucional de la Universitat Jaume I

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Repositori Obert UdL

UPF Digital Repository

Diposit Digital de Documents de la UAB

Universidad Nacional De Colombia - Repositorio Institucional UN

FigShare

RUNA - Repositorio de Saúde

Oxford University Research Archive

Cronfa at Swansea University

Red de Bibliotecas Virtuales de Ciencias Sociales de América Latina y El Caribe

Horizon / Pleins textes

Diposit Digital de la Universitat de Barcelona

Consortium of Academic Libraries of Catalonia (CBUC)

DUGiDocs – Universitat de Girona

HAL Descartes

Repositorio da Producao Cientifica e Intelectual da Unicamp

Hal-Diderot

Public Library of Science (PLOS)

ArchiMer - Institutional Archive of Ifremer

HAL-Inserm

포항공과대학교

Okina

Repositorio Institucional da Universidade de Santiago de Compostela

Fondo Bibliográfico Digital Institucional

An overlapping module identification method in protein-protein interaction networks

Author: AJ Enright
B Adamcsek
B Schwikowski
B Titz
C Liu
DL Nelson
G Cui
G Palla
GD Bader
IK Jordan
J Kim
JDJ Han
JF Xia
K Rhrissorrakrai
Lijing Li
MEJ Newman
MEJ Newman
MG Shi
O Kuchaiev
P Shafer
S Asur
S Brohee
S Van Dongen
U Güldener
V Spirin
X Yan
Xuesong Wang
Yuhu Cheng
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Generating confidence intervals on biological networks

Author: A Wagner
B Lemos
BD Ripley
C Robert
C Tucker
D Drummond
E de Silva
F Picard
G Arfken
H Hermjakob
H Yu
HB Fraser
I Agrafioti
I Xenarios
IK Jordan
J Berg
JS Bader
M Gavin
M Newman
M Stumpf
Michael PH Stumpf
MW Hahn
N Luscombe
N Metropolis
P Bork
R Cho
R Milo
R Milo
T Reguly
Thomas Thorne
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background In the analysis of networks we frequently require the statistical significance of some network statistic, such as measures of similarity for the properties of interacting nodes. The structure of the network may introduce dependencies among the nodes and it will in general be necessary to account for these dependencies in the statistical analysis. To this end we require some form of Null model of the network: generally rewired replicates of the network are generated which preserve only the degree (number of interactions) of each node. We show that this can fail to capture important features of network structure, and may result in unrealistic significance levels, when potentially confounding additional information is available. Methods We present a new network resampling Null model which takes into account the degree sequence as well as available biological annotations. Using gene ontology information as an illustration we show how this information can be accounted for in the resampling approach, and the impact such information has on the assessment of statistical significance of correlations and motif-abundances in the <it>Saccharomyces cerevisiae </it>protein interaction network. An algorithm, GOcardShuffle, is introduced to allow for the efficient construction of an improved Null model for network data. Results We use the protein interaction network of <it>S. cerevisiae</it>; correlations between the evolutionary rates and expression levels of interacting proteins and their statistical significance were assessed for Null models which condition on different aspects of the available data. The novel GOcardShuffle approach results in a Null model for annotated network data which appears better to describe the properties of real biological networks. Conclusion An improved statistical approach for the statistical analysis of biological network data, which conditions on the available biological information, leads to qualitatively different results compared to approaches which ignore such annotations. In particular we demonstrate the effects of the biological organization of the network can be sufficient to explain the observed similarity of interacting proteins.</p

University of Melbourne Institutional Repository

PROMPT: a protein mapping and comparison tool

Author: A Bairoch
A Krogh
AD Neverov
B Boeckmann
CI Castillo-Davis
D Frishman
D Frishman
DA Benson
DH Haft
Dmitrij Frishman
EV Koonin
FC Holstege
G Cochrane
G Gianese
IH Witten
IK Jordan
K Michalickova
KD Pruitt
M Di Giulio
M Gerstein
MJ Kerner
MJ Thompson
ML Riley
P Pagel
P Smialowski
P Wong
R Das
S Ghaemmaghami
SF Altschul
SP Kennedy
T Rattei
Thorsten Schmidt
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Comparison of large protein datasets has become a standard task in bioinformatics. Typically researchers wish to know whether one group of proteins is significantly enriched in certain annotation attributes or sequence properties compared to another group, and whether this enrichment is statistically significant. In order to conduct such comparisons it is often required to integrate molecular sequence data and experimental information from disparate incompatible sources. While many specialized programs exist for comparisons of this kind in individual problem domains, such as expression data analysis, no generic software solution capable of addressing a wide spectrum of routine tasks in comparative proteomics is currently available. RESULTS: PROMPT is a comprehensive bioinformatics software environment which enables the user to compare arbitrary protein sequence sets, revealing statistically significant differences in their annotation features. It allows automatic retrieval and integration of data from a multitude of molecular biological databases as well as from a custom XML format. Similarity-based mapping of sequence IDs makes it possible to link experimental information obtained from different sources despite discrepancies in gene identifiers and minor sequence variation. PROMPT provides a full set of statistical procedures to address the following four use cases: i) comparison of the frequencies of categorical annotations between two sets, ii) enrichment of nominal features in one set with respect to another one, iii) comparison of numeric distributions, and iv) correlation of numeric variables. Analysis results can be visualized in the form of plots and spreadsheets and exported in various formats, including Microsoft Excel. CONCLUSION: PROMPT is a versatile, platform-independent, easily expandable, stand-alone application designed to be a practical workhorse in analysing and mining protein sequences and associated annotation. The availability of the Java Application Programming Interface and scripting capabilities on one hand, and the intuitive Graphical User Interface with context-sensitive help system on the other, make it equally accessible to professional bioinformaticians and biologically-oriented users. PROMPT is freely available for academic users from

Proteinortho: Detection of (Co-)orthologs in large-scale analysis

Author: A Alexeyenko
A Force
A Nakabachi
A Schneider
AE Hirsh
AJ Enright
C Lanczos
D Cornaz
DM Kristensen
E Pruesse
EV Koonin
IK Jordan
J Hopcroft
JP McCutcheon
L Li
Lydia Steiner
M Fiedler
M Fiedler
M Remm
M Sikdar
Manja Marz
Marcus Lechner
MC Rivera
P Bork
Peter F Stadler
RL Tatusov
S Guattery
SM van Dongen
Sonja J Prohaska
Sven Findeiß
TJ Hubbard
WM Fitch
Z Fu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases. Results The program <monospace>Proteinortho</monospace> described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply <monospace>Proteinortho</monospace> to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes. Conclusions <monospace>Proteinortho</monospace> significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.</p

Fraunhofer-ePrints